"subcorpus" meaning in All languages combined

See subcorpus on Wiktionary

Noun [English]

Forms: subcorpora [plural]
Etymology: From sub- + corpus. Etymology templates: {{prefix|en|sub|corpus}} sub- + corpus Head templates: {{en-noun|subcorpora}} subcorpus (plural subcorpora)
  1. A subset of a corpus.

Inflected forms

{
  "etymology_templates": [
    {
      "args": {
        "1": "en",
        "2": "sub",
        "3": "corpus"
      },
      "expansion": "sub- + corpus",
      "name": "prefix"
    }
  ],
  "etymology_text": "From sub- + corpus.",
  "forms": [
    {
      "form": "subcorpora",
      "tags": [
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "subcorpora"
      },
      "expansion": "subcorpus (plural subcorpora)",
      "name": "en-noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        {
          "kind": "other",
          "name": "English entries with incorrect language header",
          "parents": [
            "Entries with incorrect language header",
            "Entry maintenance"
          ],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "English terms prefixed with sub-",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with 1 entry",
          "parents": [],
          "source": "w"
        },
        {
          "kind": "other",
          "name": "Pages with entries",
          "parents": [],
          "source": "w"
        }
      ],
      "examples": [
        {
          "ref": "2018, Clarence Green, James Lambert, “Advancing disciplinary literacy through English for academic purposes: Discipline-specific wordlists, collocations and word families for eight secondary subjects”, in Journal of English for Academic Purposes, volume 35, →DOI, page 110:",
          "text": "Thus the word react occurs 2331 times in the Chemistry subcorpus, reacts occurs 2195 times, etc., and adding all members together, the REACT family occurs 27,991 times throughout Chemistry.",
          "type": "quote"
        }
      ],
      "glosses": [
        "A subset of a corpus."
      ],
      "id": "en-subcorpus-en-noun-JgUU8WLj",
      "links": [
        [
          "subset",
          "subset"
        ],
        [
          "corpus",
          "corpus"
        ]
      ]
    }
  ],
  "word": "subcorpus"
}
{
  "etymology_templates": [
    {
      "args": {
        "1": "en",
        "2": "sub",
        "3": "corpus"
      },
      "expansion": "sub- + corpus",
      "name": "prefix"
    }
  ],
  "etymology_text": "From sub- + corpus.",
  "forms": [
    {
      "form": "subcorpora",
      "tags": [
        "plural"
      ]
    }
  ],
  "head_templates": [
    {
      "args": {
        "1": "subcorpora"
      },
      "expansion": "subcorpus (plural subcorpora)",
      "name": "en-noun"
    }
  ],
  "lang": "English",
  "lang_code": "en",
  "pos": "noun",
  "senses": [
    {
      "categories": [
        "English countable nouns",
        "English entries with incorrect language header",
        "English lemmas",
        "English nouns",
        "English nouns with irregular plurals",
        "English terms prefixed with sub-",
        "English terms with quotations",
        "Pages with 1 entry",
        "Pages with entries"
      ],
      "examples": [
        {
          "ref": "2018, Clarence Green, James Lambert, “Advancing disciplinary literacy through English for academic purposes: Discipline-specific wordlists, collocations and word families for eight secondary subjects”, in Journal of English for Academic Purposes, volume 35, →DOI, page 110:",
          "text": "Thus the word react occurs 2331 times in the Chemistry subcorpus, reacts occurs 2195 times, etc., and adding all members together, the REACT family occurs 27,991 times throughout Chemistry.",
          "type": "quote"
        }
      ],
      "glosses": [
        "A subset of a corpus."
      ],
      "links": [
        [
          "subset",
          "subset"
        ],
        [
          "corpus",
          "corpus"
        ]
      ]
    }
  ],
  "word": "subcorpus"
}

Download raw JSONL data for subcorpus meaning in All languages combined (1.3kB)


This page is a part of the kaikki.org machine-readable All languages combined dictionary. This dictionary is based on structured data extracted on 2025-01-25 from the enwiktionary dump dated 2025-01-20 using wiktextract (c15a5ce and 5c11237). The data shown on this site has been post-processed and various details (e.g., extra categories) removed, some information disambiguated, and additional data merged from other sources. See the raw data download page for the unprocessed wiktextract data.

If you use this data in academic research, please cite Tatu Ylonen: Wiktextract: Wiktionary as Machine-Readable Structured Data, Proceedings of the 13th Conference on Language Resources and Evaluation (LREC), pp. 1317-1325, Marseille, 20-25 June 2022. Linking to the relevant page(s) under https://kaikki.org would also be greatly appreciated.